Efficient Sparse Cholesky Factorization on a Massively Parallel SIMD Computer

نویسندگان

  • Fredrik Manne
  • Hjálmtyr Hafsteinsson
چکیده

We investigate the effect of load balancing when performing Cholesky factorization on a massively parallel SIMD computer. In particular we describe a supernodal algorithm for performing sparse Cholesky factorization. The way the matrix is mapped onto the processors has significant effect on its efficiency. We show that this assignment problem can be modeled as a graph coloring problem in a weighted graph. By a simple greedy algorithm, we obtain substantial speedup compared with previously suggested data mapping schemes. Experimental runs have been made on a 16K processor MasPar MP-2 parallel computer using symmetric test matrices with irregular sparsity structure. On these problems our implementation achieves performance rates of well above 200 Mflops in double precision arithmetic. Key words, sparse matrix algorithms, Cholesky factorization, systems of linear equations, parallel computing, data parallel algorithms, MasPar, graph coloring AMS subject classifications. 05C50, 05C85, 15A23, 65F05, 65F50

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Eecient Sparse Cholesky Factorization on a Parallel Simd Computer

We investigate the eeect of load balancing when performing Cholesky factor-ization on a SIMD computer. In particular we describe a supernodal algorithm for performing sparse Cholesky factorization. The way the matrix is mapped onto the processors has signiicant eeect on its eeciency. We show that this assignment problem can be modeled as a graph coloring problem in a weighted graph. By a simple...

متن کامل

Highly Parallel Sparse Cholesky Factorization

We develop and compare several fine-grained parallel algorithms to compute the Cholesky factorisation of a sparse matrix. Our experimental implementations are on the Connection Machine, a distributedmemory SIMD machine whose programming model conceptually supplies one processor per data element. In contrast to special-purpose algorithms in which the matrix structure conforms to the connection s...

متن کامل

Scalable Parallel Algorithms for Solving Sparse Systems of Linear Equations∗

We have developed a highly parallel sparse Cholesky factorization algorithm that substantially improves the state of the art in parallel direct solution of sparse linear systems—both in terms of scalability and overall performance. It is a well known fact that dense matrix factorization scales well and can be implemented efficiently on parallel computers. However, it had been a challenge to dev...

متن کامل

A PERFORMANCE STUDY OF SPARSE CHOLESKY FACTORIZATION ON INTEL iPSC/860

The problem of Cholesky factorization of a sparse matrix has been very well investigated on sequential machines. A number of efficient codes exist for factorizing large unstructured sparse matrices, for example, codes from Harwell Subroutine Library [4] and Sparspak [7]. However, there is a lack of such efficient codes on parallel machines in general, and distributed memory machines in particul...

متن کامل

Parallel Sparse Cholesky Factorization with Spectral Nested Dissection Ordering

We show that the use of the spectral nested dissection (SND) ordering leads to an efficient parallel sparse Cholesky factorization on an Intel Paragon multiprocessor. I Department of Computer Science, Old Dominion University, Norfolk, VA 23529-0162 and ICASE, NASA Langley Research Center, Hampton, VA 236810001 (pothsnlcs .odu. edu, pothsnOicaso, sdu). The research of the first and fourth author...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Scientific Computing

دوره 16  شماره 

صفحات  -

تاریخ انتشار 1995